Enabling Reproducible Science with VisTrails

نویسندگان

  • David Koop
  • Juliana Freire
  • Cláudio T. Silva
چکیده

With the increasing amount of data and use of computation in science, software has become an important component in many different domains. Computing is now being used more often and in more aspects of scientific work including data acquisition, simulation, analysis, and visualization. To ensure reproducibility, it is important to capture the different computational processes used as well as their executions. VisTrails is an open-source scientific workflow system for data analysis and visualization that seeks to address the problem of integrating varied tools as well as automatically documenting the methods and parameters employed. Growing from a specific project need to supporting a wide array of users required close collaborations in addition to new research ideas to design a usable and efficient system. The VisTrails project now includes standard software processes like unit testing and developer documentation while serving as a base for further research. In this paper, we describe how VisTrails has developed and how our efforts in structuring and advertising the system have contributed to its adoption in many domains.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Facilitating Reproducible Computing via Scientific Workflows -- an Integrated System Approach

Author: Cao, Yuan. MS Institution: Purdue University Degree Received: May 2017 Title: Facilitating Reproducible Computing via Scientific Workflows -An Integrated System Approach Major Professor: Yao Liang Reproducible computing and research are of great importance for scientific investigation in any discipline. This thesis presents a general approach to provenance in the context of workflows fo...

متن کامل

Editorial : Scientific Workflows , Provenance and Their Applications

Scientific workflows play a crucial role in modern eScience [5] where many significant scientific discoveries are achieved through complex and distributed computations. For many scientists in the Life Sciences, in bioinformatics, geosciences, chemistry, physics, and numerous other domains, scientific workflows have become an enabling technology to formalize and automate complex and data intensi...

متن کامل

Tackling the Provenance Challenge one layer at a time

VisTrails is a new workflow and provenance management system that provides support for scientific data exploration and visualization. Whereas workflows have been traditionally used to automate repetitive tasks, for applications that are exploratory in nature, change is the norm. VisTrails uses a new change-based provenance mechanism which was designed to handle rapidly-evolving workflows. It un...

متن کامل

A Customized Python module for Cfd flow

R esearchers in the open source community are steadily improving scientific visualization tools. These new tools are providing a wider array of sophisticated probes for data analysis and a wider assortment of effective user-friendly interfaces. They’re also making it easier for researchers in the computational science community—across many disciplines—to effectively analyze huge datasets by dra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:
  • CoRR

دوره abs/1309.1784  شماره 

صفحات  -

تاریخ انتشار 2013